Robust and scalable content-and-structure indexing
نویسندگان
چکیده
Abstract Frequent queries on semi-structured hierarchical data are Content-and-Structure (CAS) that filter items based their location in the structure and value for some attribute. We propose Robust Scalable (RSCAS) index to efficiently answer CAS big data. To get an is robust against with varying selectivities, we introduce a novel dynamic interleaving merges path dimensions of composite keys balanced manner. store interleaved our trie-based RSCAS index, which supports wide range queries, including wildcards descendant axes. implement as log-structured merge tree scale it data-intensive applications high insertion rate. illustrate RSCAS’s robustness scalability by indexing from Software Heritage (SWH) archive, world’s largest, publicly available source code archive.
منابع مشابه
Robust Content-based Image Indexing
In this paper we present a robust information integration approach to identifying images of persons in large collections, such as the web. The underlying system relies on combining content analysis, which involves face detection and recognition, with context analysis which involves extraction of text or HTML features. Two aspects are explored to test the robustness of this approach: Sensitivity...
متن کاملContent-Scalable Analysis for Video Indexing and Retrieval
Video content-scalability for video indexing and retrieval is proposed. Recently, the demand for content-based multimedia applications is increasing even beyond the capabilities of best-effort transmission networks. Therefore, the trend is toward constructing a content-oriented multimedia server that is capable of handling high volumes of content as well as of fulfilling high performance and va...
متن کاملthe underlying structure of language proficiency and the proficiency level
هدف از انجام این تخقیق بررسی رابطه احتمالی بین سطح مهارت زبان خارجی (foreign language proficiency) و ساختار مهارت زبان خارجی بود. تعداد 314 زبان آموز مونث و مذکر که عمدتا دانشجویان رشته های زبان انگلیسی در سطوح کارشناسی و کارشناسی ارشد بودند در این تحقیق شرکت کردند. از لحاظ سطح مهارت زبان خارجی شرکت کنندگان بسیار با هم متفاوت بودند، (75 نفر سطح پیشرفته، 113 نفر سطح متوسط، 126 سطح مقدماتی). کلا ...
15 صفحه اولContent-based Watermarking for Indexing Using Robust Segmentation
In this paper, a novel approach to image indexing is presented using content-based watermarking. Some concepts associated with the application of watermarking to image indexing are discussed and a segmentation algorithm, appropriate for content-based watermarking, is presented. The segmentation algorithm is applied on reduced images and derives the exact same objects when performed on either th...
متن کاملShapeFit and ShapeKick for Robust, Scalable Structure from Motion
We introduce a new method for location recovery from pairwise directions that leverages an efficient convex program that comes with exact recovery guarantees, even in the presence of adversarial outliers. When pairwise directions represent scaled relative positions between pairs of views (estimated for instance with epipolar geometry) our method can be used for location recovery, that is the de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Vldb Journal
سال: 2022
ISSN: ['0949-877X', '1066-8888']
DOI: https://doi.org/10.1007/s00778-022-00764-y